skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Guo, Hao"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Free, publicly-accessible full text available April 13, 2026
  2. Abstract BackgroundWhile most health-care providers now use electronic health records (EHRs) to document clinical care, many still treat them as digital versions of paper records. As a result, documentation often remains unstructured, with free-text entries in progress notes. This limits the potential for secondary use and analysis, as machine-learning and data analysis algorithms are more effective with structured data. ObjectiveThis study aims to use advanced artificial intelligence (AI) and natural language processing (NLP) techniques to improve diagnostic information extraction from clinical notes in a periodontal use case. By automating this process, the study seeks to reduce missing data in dental records and minimize the need for extensive manual annotation, a long-standing barrier to widespread NLP deployment in dental data extraction. Materials and MethodsThis research utilizes large language models (LLMs), specifically Generative Pretrained Transformer 4, to generate synthetic medical notes for fine-tuning a RoBERTa model. This model was trained to better interpret and process dental language, with particular attention to periodontal diagnoses. Model performance was evaluated by manually reviewing 360 clinical notes randomly selected from each of the participating site’s dataset. ResultsThe results demonstrated high accuracy of periodontal diagnosis data extraction, with the sites 1 and 2 achieving a weighted average score of 0.97-0.98. This performance held for all dimensions of periodontal diagnosis in terms of stage, grade, and extent. DiscussionSynthetic data effectively reduced manual annotation needs while preserving model quality. Generalizability across institutions suggests viability for broader adoption, though future work is needed to improve contextual understanding. ConclusionThe study highlights the potential transformative impact of AI and NLP on health-care research. Most clinical documentation (40%-80%) is free text. Scaling our method could enhance clinical data reuse. 
    more » « less
    Free, publicly-accessible full text available May 2, 2026
  3. Free, publicly-accessible full text available January 1, 2026
  4. Ice-nucleating proteins (INPs) catalyze ice formation at high subzero temperatures, with major biological and environmental implications. While bacterial INPs have been structurally characterized, their counterparts in other organisms remain unknown. Here, we identify a new class of efficient INPs in fungi. These proteins are membrane-free, adopt β-solenoid folds, and multimerize to form large ice-binding surfaces, showing mechanistic parallels with bacterial INPs. Structural modeling, sequence analysis, and functional assays show they are encoded by orthologs of the bacterial InaZ gene, likely acquired via horizontal gene transfer. Our results demonstrate that distinct lineages have independently converged on a common molecular strategy to overcome the energetic barriers of ice formation. The discovery of cell-free INPs provides tools for freezing applications and reveals biophysical constraints on nucleation across life. 
    more » « less
    Free, publicly-accessible full text available May 19, 2026
  5. Theθ-temperature of linear, ring, and poly[n]catenane polymers as characterized by varying LJ cutoff,rcvalues. Higherrcvalues correspond to lowerθ-temperature. 
    more » « less
  6. Bacterial ice nucleating proteins (INPs) are exceptionally effective in promoting the kinetically hindered transition of water to ice. Their efficiency relies on the assembly of INPs into large functional aggregates, with the size of ice nucleation sites determining activity. Experimental freezing spectra have revealed two distinct, defined aggregate sizes, typically classified as class A and C ice nucleators (INs). Despite the importance of INPs and years of extensive research, the precise number of INPs forming the two aggregate classes, and their assembly mechanism have remained enigmatic. Here, we report that bacterial ice nucleation activity emerges from more than two prevailing aggregate species and identify the specific number of INPs responsible for distinct crystallization temperatures. We find that INP dimers constitute class C INs, tetramers class B INs, and hexamers and larger multimers are responsible for the most efficient class A activity. We propose a hierarchical assembly mechanism based on tyrosine interactions for dimers, and electrostatic interactions between INP dimers to produce larger aggregates. This assembly is membrane-assisted: Increasing the bacterial outer membrane fluidity decreases the population of the larger aggregates, while preserving the dimers. Inversely, Dulbecco’s Phosphate-Buffered Saline buffer increases the population of multimeric class A and B aggregates 200-fold and endows the bacteria with enhanced stability toward repeated freeze-thaw cycles. Our analysis suggests that the enhancement results from the better alignment of dimers in the negatively charged outer membrane, due to screening of their electrostatic repulsion. This demonstrates significant enhancement of the most potent bacterial INs. 
    more » « less
  7. We present a generalization of the geometric phase to pure and thermal states in $$\mathcal{PT}$$-symmetric quantum mechanics (PTQM) based on the approach of the interferometric geometric phase (IGP). The formalism first introduces the parallel-transport conditions of quantum states and reveals two geometric phases, $$\theta^1$$ and $$\theta^2$$, for pure states in PTQM according to the states under parallel-transport. Due to the non-Hermitian Hamiltonian in PTQM, $$\theta^1$$ is complex and $$\theta^2$$ is its real part. The imaginary part of $$\theta^1$$ plays an important role when we generalize the IGP to thermal states in PTQM. The generalized IGP modifies the thermal distribution of a thermal state, thereby introducing effective temperatures. \textcolor{red}{At certain critical points, the generalized IGP may exhibit discrete jumps at finite temperatures, signaling a geometric phase transition. We illustrate the IGP of PTQM through two examples and compare their differences}. 
    more » « less